Amendments to the Claims 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 

Listing of Claims: 

1. (Original) A method comprising: 

detecting a phone in an off-hook state; 

retrieving with a telephony server information associated with a user assigned to the 

phone; 

generating a custom input grammar with the telephony server using the information; 
generating a dial-tone with the telephony server; 
receiving with the telephony server a command spoken into the phone; 
processing the spoken command with the telephony server to locate a corresponding 
entry in the custom input grammar; and 

executing a command operation associated with the corresponding entry. 

2. (Original) The method of claim 1, wherein the custom input grammar is not generated until 
an identification of a person who spoke the command is performed, and wherein the custom 
input grammar is then generated based on the particular profile of the person. 

3. (Original) The method of claim 1, wherein said processing comprises: 

sending the spoken command to a speech recognition server, said speech recognition 
server processing the spoken command, locating the corresponding entry in the custom input 
grammar, and returning the corresponding entry to the telephony server. 
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4. (Original) The method of claim 3 5 wherein the speech recognition server verifies the identity 
of a person that spoke the command to ensure the person is authorized to access the custom input 
grammar before locating the corresponding entry in the custom input grammar. 

5. (Original) The method of claim 1 5 wherein the custom input grammar is generated from a 
text-based contacts database associated with the user assigned to the phone. 

6. (Original) The method of claim 5, wherein the spoken command is a name of a person in the 
text-based contacts database associated with the user assigned to the phone. 

7. (Original) The method of claim 1, wherein the command is spoken into the phone by a 
person other than the user assigned to the phone. 

8. (Original) The method of claim 1, wherein said generating the custom input grammar is only 
performed if the custom input grammar does not already exist for the user associated with the 
phone or if the custom grammar exists but needs updated due to modifications in an underlying 
data source. 

9. (Original) The method of claim 1, wherein the dial-tone is cancelled when the telephony 
application processor begins receiving the command spoken into the phone. 
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10. (Original) The method of claim 1, wherein said processing the spoken command with the 
telephony application processor comprises: 

sending a recognition request to a speech recognition server; 
receiving a probing request from the speech recognition server; 
sending a UDP probe response message to a probing port number of the speech 
recognition server; 

sending the spoken command to the speech recognition server, said speech recognition 
server determining a translated result based on the custom input grammar; and 
receiving the translated result from the speech recognition server. 

11. (Withdrawn) A method comprising: 

providing a probing endpoint for a first server; 

receiving at a second server a port number of the probing endpoint of the first server; 
receiving at the second server a delivery request for which probing is requested from the 
first server; and 

sending a UDP probe response message to the port number of the first server. 

12. (Withdrawn) A method comprising: 

providing a probing endpoint for a speech recognition server; 
receiving at a telephony server a port number of the probing endpoint of the speech 
recognition server; 

receiving at the telephony server an audio delivery request for which probing is requested 
from the speech recognition server; and 
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sending a UDP probe response message to the port number of the speech recognition 

server. 

13. (Withdrawn) The method of claim 12, further comprising: 

receiving the probe response message at the speech recognition server. 

14. (Withdrawn) The method of claim 13, further comprising: 

reviewing an identifier contained in the probe response message to confirm the probe 
response message was received from the requested telephony server. 

15. (Withdrawn) The method of claim 12, further comprising: 

sending at least one audio packet to the speech recognition server. 

16. (Withdrawn) The method of claim 15, wherein the at least one audio packet sent to the 
speech recognition server was a command spoken into a phone and received by the telephony 
server. 

17. (Withdrawn) The method of claim 12, wherein said receiving at the telephony server the 
port number of the probing endpoint occurs when the telephony server first requests a 
recognition operation from the speech recognition server. 

1 8. (Withdrawn) The method of claim 12, wherein the probing endpoint has a same IP address 
as used for delivering audio to the speech recognition server. 
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19. (Withdrawn) A method comprising: 

providing a probing endpoint for a speech recognition server; and 

sending from the speech recognition server a plurality of probing requests to a telephony 

server until the telephony server sends a UDP probe response message or until a predetermined 

quantity of missed probes has been exceeded. 

20. (Withdrawn) A method comprising: 

providing an audio streaming packet; 

receiving an RTP physical sequence number associated with the streaming audio packet; 
receiving a last logical sequence number that was most recently generated; and 
generating a new logical sequence number by a process comprising: 

adding a fixed-size kilobyte amount to the RTP physical sequence number; 

generating a scale factor by subtracting the fixed-size kilobyte amount from the 
last logical sequence number and masking off from the result a plurality of bits from a 
lowest bit range; and 

adding the scale factor to the RTP physical sequence number. 

21. (Withdrawn) The method of claim 20, further comprising: 

adjusting the new logical sequence number using a revised scale factor if the new logical 
sequence number is not within a certain predetermined range of the last logical sequence 
number. 
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22. (Withdrawn) The method of claim 20, wherein the process repeats for each of a plurality of 
audio streaming packets received. 

23. (Withdrawn) The method of claim 22 5 wherein the new logical sequence number of each of 
the plurality of audio streaming packets is used to reassemble the plurality of packets into a 
logical order. 

24. (Withdrawn) A method comprising: 

allocating an internal buffer list with a plurality of fixed size buffers totaling a maximum 
receive packet size; 

passing the internal buffer list to an operating system as a scatter/gather array; 
filling at least a portion of the plurality of fixed size buffers in order when a packet is 
received; and 

freeing the unused fixed size buffers back to the internal buffer list. 

25. (Withdrawn) The method of claim 24, wherein the packet is a streaming audio packet. 

26. (Withdrawn) The method of claim 25, wherein the streaming audio packet contains 
information to be processed by a speech recognition server. 

27. (Withdrawn) The method of claim 26, wherein the information to be processed by the 
speech recognition server includes a command that was spoken into a phone. 
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28. (Original) A system comprising: 

a speech recognition server; and 

a telephony application server coupled to the speech recognition server over a network, 
the telephony application server being operative to detect a phone in an off-hook state, retrieve 
information associated with a user assigned to the phone, generate a custom input grammar using 
the information, generate a dial-tone, receive a command spoken into the phone, send the spoken 
command to the speech recognition server, receive a corresponding entry based on the custom 
input grammar from the speech recognition server and execute a command operation associated 
with the corresponding entry. 

29. (Original) The system of claim 28, wherein the speech recognition server is operative to 
support a plurality of speech recognition engines. 

30. (Original) The system of claim 28, wherein the speech recognition server is operative to 
send a port number of a probing endpoint to the telephony application server, send a probing 
request to the telephony application server, and receive from the telephony application server a 
UDP probe response message at the port number. 

31. -33. (Canceled) 

34. (Original) A system comprising: 

multiple speech recognition engines residing on one or more speech recognition servers; 

and 
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a telephony server having a telephony application processor operable to translate vendor- 
neutral interfaces to and from a specific syntax required by each of the multiple recognition 
engines. 

35. (Original) The system of claim 34, wherein the telephony application processor is operable 
to perform speaker identification and verification as part of a recognition operation. 

36. (Original) The system of claim 34, wherein the telephony application processor is operable 
to send recognition requests to at least two of the multiple speech recognition engines at the same 
time. 

37. (Original) A method, comprising: 

offering a telephony application interface routine including a voice recognition interface 
operable with multiple speech recognition engines; 

providing the telephony application interface to a first customer having a pre-established 
grammar for a first one of the speech recognition engines; 

the first customer operating the telephony application interface with the pre-established 
grammar of the first one of the speech recognition engines; 

providing the telephony application interface to a second customer having a second one 
of the speech recognition engines; and 

the second customer operating the telephony application interface with the second one of 
the speech recognition engines. 
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38. (Canceled) 



39. (Original) A method comprising: 

detecting a user being connected to a telephony server; 
identifying the user; 

retrieving information associated with the user; 
generating a custom input grammar using the information; 
receiving with the telephony server a command spoken by the user; 
processing the spoken command to locate a corresponding entry in the custom input 
grammar; and 

executing a command operation associated with the corresponding entry. 
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